Blog Annotation: From Corpus Analysis to Automatic Tag Suggestion
نویسندگان
چکیده
منابع مشابه
Blog Annotation: From Corpus Analysis to Automatic Tag Suggestion
Nowadays, blogs cover a large audience and they become part of mainstream media. Tags and categories are structural elements of a blog post intended to increase a blog’s visibility and enhance navigation and searching. We suppose that those annotations are made on subjective grounds rather than in a systematic way. This paper presents a 11 million words corpus of blogs posts in French dedicated...
متن کاملTagAssist: Automatic Tag Suggestion for Blog Posts
In this paper, we describe a system called TagAssist that provides tag suggestions for new blog posts by utilizing existing tagged posts. The system is able to increase the quality of suggested tags by performing lossless compression over existing tag data. In addition, the system employs a set of metrics to evaluate the quality of a potential tag suggestion. Coupled with the ability for users ...
متن کاملAutomatic Tag Suggestion Based on Resource Contents
In this paper, we present an automatic tag suggester, Tess. Our system makes recommendations based only on the textual contents of the resource and is independent of existing tags, thus allowing the emergence of novel tags. Preliminary evaluation experiments show that the system is not only able to suggest many useful tags, but also to discover new and relevant tags, not suggested by any of the...
متن کاملPSG hybrid approach to automatic corpus annotation
This paper describes and evaluates a hybrid non-probabilistic parsing method for the grammatical annotation of large corpora and the live analysis of teaching sentences, employing a layered scheme of lexiconand contextbased Constraint Grammars on the one hand, and Phrase Structure Grammars or syntactic bracketing algorithms on the other. The method has been fully implemented by the author for D...
متن کاملOptimal Tag Sets for Automatic Image Annotation
In this paper we introduce the Beam Search CRM (BS-CRM) model. This model implements two novel improvements to the basic CRM [2]. First, we argue that using a Minkowski kernel allows us to capture the covariance of visual features more effectively than the standard Gaussian kernel. Second, we advocate a procedure that selects the most informative subset of tags as the image annotation. Our proc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Research in Computing Science
سال: 2016
ISSN: 1870-4069
DOI: 10.13053/rcs-110-1-8